Program Optimisations via Hylomorphisms for Extraction of Executable Code
drops.dagstuhl.deยท23hยท
Discuss: Hacker News
๐Ÿ‘‘Coq Tactics
How many valid JSON strings are there?
qntm.orgยท6h
โœ…Format Verification
User Defined Types and Custom Metadata in DataFusion
datafusion.apache.orgยท11hยท
Discuss: Hacker News
๐Ÿ“‹DFDL
OpenDataLoader-PDF: An open source tool for structured PDF parsing
github.comยท14hยท
Discuss: Hacker News
๐Ÿ“„PDF Internals
Why Tables Are the Hardest Problem in Document AI
runpulse.comยท1dยท
Discuss: Hacker News
๐Ÿ“„Document AI
The Database Zoo: SQL, NoSQL, and the Rise of Specialized Engines
hackernoon.comยท1d
๐ŸบDatabase Archaeology
Scaling Speculative Decoding with Lookahead Reasoning
hao-ai-lab.github.ioยท1d
๐Ÿ”งReed-Solomon Decoders
Multi-Hierarchical Feature Detection for Large Language Model Generated Text
arxiv.orgยท18m
๐Ÿค–Advanced OCR
From Chaos to Clarity: Leveraging Pydantic for Smarter AI
dev.toยท23hยท
Discuss: DEV
โœ…Archive Validation
Things in the "Context Plane" โ€“ By Shagility
agiledata.substack.comยท7hยท
Discuss: Substack
๐Ÿ”—Data Provenance
Codifying Natural Langauge Tasks
arxiv.orgยท1d
๐Ÿ“‹Document Grammar
Show HN: Embedding Explorer โ€“ compare text embedding models in your browser
github.comยท10hยท
Discuss: Hacker News
๐ŸŒ€Brotli Dictionary
From Documents to Database: Failure Modes for Industrial Assets
arxiv.orgยท1d
๐Ÿ“šDocumentation Archaeology
LLM Features That Ship: Extraction, Generation, and Classification
alex-jacobs.comยท12hยท
Discuss: Hacker News
๐ŸŒ€Brotli Internals
Open Political Corpora: Structuring, Searching, and Analyzing Political Text Collections with PoliCorp
arxiv.orgยท1d
๐Ÿ“„Semantic Chunking
Compositional Interface Refinement Through Subtyping in Probabilistic Session Types
arxiv.orgยท1d
๐Ÿ“žSession Types